Estimating System Effectiveness Scores With Incomplete Evidence
نویسندگان
چکیده
It is common for only partial relevance judgments to be used when comparing retrieval system effectiveness, in order to control experimental cost. Using TREC data, we consider the uncertainty introduced into per-topic effectiveness scores by pooled judgments, and measure the effect that incomplete evidence has on both the systems scores that are generated, and also on the quality of paired system comparisons. We measure system behavior from three different points of view: the trend in effectiveness scores; the separability of system pairs; and the number of reversals in significance outcomes as the depth of judgments increases. Our results show that when shallow pooled judgments are used system separability remains relatively high, but that there is also a high rate of significance reversal. We then show that explicitly adjusting effectiveness scores to allow for the known amount of uncertainty gives a reduced number of reversals, and hence more consistent experimental outcomes.
منابع مشابه
The Effectiveness of the Automatic System of Fuzzy Logic-Based Technical Patterns Recognition: Evidence from Tehran Stock Exchange
The present research proposes an automatic system based on moving average (MA) and fuzzy logic to recognize technical analysis patterns including head and shoulder patterns, triangle patterns and broadening patterns in the Tehran Stock Exchange. The automatic system was used on 38 indicators of Tehran Stock Exchange within the period 2014-2017 in order to evaluate the effectiveness of technical...
متن کاملINCOMPLETE INTERVAL-VALUED HESITANT FUZZY PREFERENCE RELATIONS IN DECISION MAKING
In this article, we propose a method to deal with incomplete interval-valuedhesitant fuzzy preference relations. For this purpose, an additivetransitivity inspired technique for interval-valued hesitant fuzzypreference relations is formulated which assists in estimating missingpreferences. First of all, we introduce a condition for decision makersproviding incomplete information. Decision maker...
متن کاملخدمات مؤثر جامعهنگر برای مبتلایان به اختلالهای روانپزشکی کدامند؟ مروری بر شواهد
Objectives : The study aimed at reviewing evidence of effectiveness and cost-effectiveness of community-based services for psychiatric disorders. Method: Cochrane and PubMed bibliographic databases and reference lists of relevant studies were investigated. Systematic reviews and controlled randomized clinical trials were analyzed. The inclusion criteria were having effectiveness and cost-effe...
متن کاملMarginal Analysis of A Population-Based Genetic Association Study of Quantitative Traits with Incomplete Longitudinal Data
A common study to investigate gene-environment interaction is designed to be longitudinal and population-based. Data arising from longitudinal association studies often contain missing responses. Naive analysis without taking missingness into account may produce invalid inference, especially when the missing data mechanism depends on the response process. To address this issue in the ana...
متن کاملA New Extended Analytical Hierarchy Process Technique with Incomplete Interval-valued Information for Risk Assessment in IT Outsourcing
Information technology (IT) outsourcing has been recognized as a new methodology in many organizations. Yet making an appropriate decision with regard to selection and use of these methodologies may impose uncertainties and risks. Estimating the occurrence probability of risks and their impacts organizations goals may reduce their threats. In this study, an extended analytical hierarchical proc...
متن کامل